Differentiating Homonymy and Polysemy in Information Retrieval
نویسنده
چکیده
Recent studies into Web retrieval have shown that word sense disambiguation can increase retrieval effectiveness. However, it remains unclear as to the minimum disambiguation accuracy required and the granularity with which one must define word sense in order to maximize these benefits. This study answers these questions using a simulation of the effects of ambiguity on information retrieval. It goes beyond previous studies by differentiating between homonymy and polysemy. Results show that retrieval is more sensitive to polysemy than homonymy and that, when resolving polysemy, accuracy as low as 55% can potentially lead to increased performance.
منابع مشابه
Homonymy and Polysemy in Information Retrieval
This paper discusses research on distinguishing word meanings in the context of information retrieval systems. We conducted experiments with three sources of evidence for making these distinctions: morphology, part-of-speech, and phrases. We have focused on the distinction between homonymy and polysemy (unrelated vs. related meanings). Our results support the need to distinguish homonymy and p ...
متن کاملOntology-based Distinction between Polysemy and Homonymy
We consider the problem of distinguishing polysemous from homonymous nouns. This distinction is often taken for granted, but is seldom operationalized in the shape of an empirical model. We present a first step towards such a model, based on WordNet augmented with ontological classes provided by CoreLex. This model provides a polysemy index for each noun which (a), accurately distinguishes betw...
متن کاملHomonymy and Polysemy in the Czech Morphological Dictionary
We focus on a problem of homonymy and polysemy in morphological dictionaries on the example of the Czech morphological dictionary MorfFlex CZ [2]. It is not necessary to distinguish meanings in morphological dictionaries unless the distinction has consequencies in word formation or syntax. The contribution proposes several important rules and principles for achieving consistency.
متن کاملبررسی مشکلات جستوجو و بازیابی اطلاعات در پایگاههای اطلاعاتی از جنبه ویژگیهای نگارشی زبان فارسی
The present research was carried out with the aim of explicating the major writing and semantic problems of Persian language when using data environments and determining the degree of compatibility and attention to these features in Persian databases. This research is of survey analytical type being conducted through direct observation. Having reviewed the related literature, we kept a checkli...
متن کاملLexical ambiguity in L2: Homonymy and polysemy among Polish-English bilinguals
The M.A. study reported here aimed at examining the processing of isolated lexically ambiguous words in L2 by fluent Polish-English bilinguals. Just as many people speak more then one language (Cook 2002: 22), most words prove to have more than one meaning or sense (Rodd er al. 2004: 90). Consequently, it appears crucial to add the ambiguity factor to the analysis of the lexicon. Indeed, a vari...
متن کامل